Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

"Traiter des " masses " de données prosopographiques par la numérisation d'annuaires : entre espoirs et vertiges"

Identifieur interne : 000234 ( Main/Exploration ); précédent : 000233; suivant : 000235

"Traiter des " masses " de données prosopographiques par la numérisation d'annuaires : entre espoirs et vertiges"

Auteurs : Sylvain Laurens [France]

Source :

RBID : Hal:halshs-00747762

Descripteurs français

English descriptors

Abstract

Treating "Masses" of Prosopographical Data by Scanning Directories - Hopes and Disorientation: This note aims to provide an update on the progress made in optical character recognition (OCR) and the contribution of these techniques to the creation of prosopographical data bases in social sciences. With the example of a European investigation of European business associations, it highlights the progress made possible by OCR with the analysis of several biographical directories identifying groups of business interests. 8ased on this example, it is hypothesized that the development of digital technologies allow the creation of corpuses of data much larger th an in the past in the framework of quantitative inquiries conducted by smaller teams. However, this article also highlights the fact that this extension of corpuses - made possible by scanning - raises new problems of method, starting with the increased time devoted to the standardization of digital data

Url:


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="fr">"Traiter des " masses " de données prosopographiques par la numérisation d'annuaires : entre espoirs et vertiges"</title>
<author>
<name sortKey="Laurens, Sylvain" sort="Laurens, Sylvain" uniqKey="Laurens S" first="Sylvain" last="Laurens">Sylvain Laurens</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-74779" status="VALID">
<idno type="IdRef">165865644</idno>
<idno type="RNSR">200415075Y</idno>
<orgName>Groupe de Recherches et d'Etudes Sociologiques du Centre-Ouest</orgName>
<orgName type="acronym">GRESCO</orgName>
<date type="start">2004-01-01</date>
<desc>
<address>
<addrLine>Université de Poitiers - UFR SHA 8, rue René Descartes - TSA 81118 86073 Poitiers Cedex 9</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://gresco.labo.univ-poitiers.fr/</ref>
</desc>
<listRelation>
<relation name="EA3815" active="#struct-5928" type="direct"></relation>
<relation name="EA3815" active="#struct-54493" type="direct"></relation>
<relation name="EA3815" active="#struct-302000" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA3815" active="#struct-5928" type="direct">
<org type="institution" xml:id="struct-5928" status="VALID">
<idno type="IdRef">026403315</idno>
<idno type="ISNI">0000000121654861</idno>
<orgName>Université de Limoges</orgName>
<orgName type="acronym">UNILIM</orgName>
<date type="start">1968-10-01</date>
<desc>
<address>
<addrLine>33 rue François Mitterrand BP23204 87032 Limoges</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.unilim.fr</ref>
</desc>
</org>
</tutelle>
<tutelle name="EA3815" active="#struct-54493" type="direct">
<org type="institution" xml:id="struct-54493" status="VALID">
<orgName>Université de Poitiers</orgName>
<desc>
<address>
<addrLine>15, rue de l'Hôtel Dieu - 86034 Poitiers Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-poitiers.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="EA3815" active="#struct-302000" type="direct">
<org type="institution" xml:id="struct-302000" status="VALID">
<orgName>Institut Sciences de l'Homme et de la Société</orgName>
<orgName type="acronym">IR SHS UNILIM</orgName>
<desc>
<address>
<addrLine>5 rue Félix EbouéB.P. 312787031 Limoges Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.unilim.fr/SHS-Institut-Sciences-de-l-Homme</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Limoges</settlement>
<region type="region" nuts="2">Limousin</region>
</placeName>
<orgName type="university">Université de Limoges</orgName>
<placeName>
<settlement type="city">Poitiers</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de Poitiers</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">HAL</idno>
<idno type="RBID">Hal:halshs-00747762</idno>
<idno type="halId">halshs-00747762</idno>
<idno type="halUri">https://halshs.archives-ouvertes.fr/halshs-00747762</idno>
<idno type="url">https://halshs.archives-ouvertes.fr/halshs-00747762</idno>
<date when="2012">2012</date>
<idno type="wicri:Area/Hal/Corpus">000134</idno>
<idno type="wicri:Area/Hal/Curation">000134</idno>
<idno type="wicri:Area/Hal/Checkpoint">000072</idno>
<idno type="wicri:doubleKey">0759-1063:2012:Laurens S:traiter:des:masses</idno>
<idno type="wicri:Area/Main/Merge">000330</idno>
<idno type="wicri:Area/Main/Curation">000234</idno>
<idno type="wicri:Area/Main/Exploration">000234</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="fr">"Traiter des " masses " de données prosopographiques par la numérisation d'annuaires : entre espoirs et vertiges"</title>
<author>
<name sortKey="Laurens, Sylvain" sort="Laurens, Sylvain" uniqKey="Laurens S" first="Sylvain" last="Laurens">Sylvain Laurens</name>
<affiliation wicri:level="1">
<hal:affiliation type="laboratory" xml:id="struct-74779" status="VALID">
<idno type="IdRef">165865644</idno>
<idno type="RNSR">200415075Y</idno>
<orgName>Groupe de Recherches et d'Etudes Sociologiques du Centre-Ouest</orgName>
<orgName type="acronym">GRESCO</orgName>
<date type="start">2004-01-01</date>
<desc>
<address>
<addrLine>Université de Poitiers - UFR SHA 8, rue René Descartes - TSA 81118 86073 Poitiers Cedex 9</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://gresco.labo.univ-poitiers.fr/</ref>
</desc>
<listRelation>
<relation name="EA3815" active="#struct-5928" type="direct"></relation>
<relation name="EA3815" active="#struct-54493" type="direct"></relation>
<relation name="EA3815" active="#struct-302000" type="direct"></relation>
</listRelation>
<tutelles>
<tutelle name="EA3815" active="#struct-5928" type="direct">
<org type="institution" xml:id="struct-5928" status="VALID">
<idno type="IdRef">026403315</idno>
<idno type="ISNI">0000000121654861</idno>
<orgName>Université de Limoges</orgName>
<orgName type="acronym">UNILIM</orgName>
<date type="start">1968-10-01</date>
<desc>
<address>
<addrLine>33 rue François Mitterrand BP23204 87032 Limoges</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.unilim.fr</ref>
</desc>
</org>
</tutelle>
<tutelle name="EA3815" active="#struct-54493" type="direct">
<org type="institution" xml:id="struct-54493" status="VALID">
<orgName>Université de Poitiers</orgName>
<desc>
<address>
<addrLine>15, rue de l'Hôtel Dieu - 86034 Poitiers Cedex</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.univ-poitiers.fr/</ref>
</desc>
</org>
</tutelle>
<tutelle name="EA3815" active="#struct-302000" type="direct">
<org type="institution" xml:id="struct-302000" status="VALID">
<orgName>Institut Sciences de l'Homme et de la Société</orgName>
<orgName type="acronym">IR SHS UNILIM</orgName>
<desc>
<address>
<addrLine>5 rue Félix EbouéB.P. 312787031 Limoges Cedex 1</addrLine>
<country key="FR"></country>
</address>
<ref type="url">http://www.unilim.fr/SHS-Institut-Sciences-de-l-Homme</ref>
</desc>
</org>
</tutelle>
</tutelles>
</hal:affiliation>
<country>France</country>
<placeName>
<settlement type="city">Limoges</settlement>
<region type="region" nuts="2">Limousin</region>
</placeName>
<orgName type="university">Université de Limoges</orgName>
<placeName>
<settlement type="city">Poitiers</settlement>
<region type="region" nuts="2">Poitou-Charentes</region>
</placeName>
<orgName type="university">Université de Poitiers</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Bulletin de Méthodologie Sociologique / Bulletin of Sociological Methodology</title>
<idno type="ISSN">0759-1063</idno>
<imprint>
<date type="datePub">2012</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="mix" xml:lang="en">
<term>Data entry</term>
<term>EU lobbies</term>
<term>optical character recognition (OCR)</term>
<term>prosopography</term>
<term>recoding data</term>
</keywords>
<keywords scheme="mix" xml:lang="fr">
<term>Lobbys européens.</term>
<term>Prosopographie</term>
<term>Recodage des données</term>
<term>Reconnaissance optique des caractères (OCR</term>
<term>Saisie de données</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Treating "Masses" of Prosopographical Data by Scanning Directories - Hopes and Disorientation: This note aims to provide an update on the progress made in optical character recognition (OCR) and the contribution of these techniques to the creation of prosopographical data bases in social sciences. With the example of a European investigation of European business associations, it highlights the progress made possible by OCR with the analysis of several biographical directories identifying groups of business interests. 8ased on this example, it is hypothesized that the development of digital technologies allow the creation of corpuses of data much larger th an in the past in the framework of quantitative inquiries conducted by smaller teams. However, this article also highlights the fact that this extension of corpuses - made possible by scanning - raises new problems of method, starting with the increased time devoted to the standardization of digital data</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Limousin</li>
<li>Poitou-Charentes</li>
</region>
<settlement>
<li>Limoges</li>
<li>Poitiers</li>
</settlement>
<orgName>
<li>Université de Limoges</li>
<li>Université de Poitiers</li>
</orgName>
</list>
<tree>
<country name="France">
<region name="Limousin">
<name sortKey="Laurens, Sylvain" sort="Laurens, Sylvain" uniqKey="Laurens S" first="Sylvain" last="Laurens">Sylvain Laurens</name>
</region>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000234 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000234 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Hal:halshs-00747762
   |texte=   "Traiter des " masses " de données prosopographiques par la numérisation d'annuaires : entre espoirs et vertiges"
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024